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Using each node's degree as a proxy for its importance, the topological hierarchy of a complex 
network is introduced and quantified. We propose a simple dynamical process used to construct 
networks which are either maximally or minimally hierarchical. Comparison with these extremal 
cases as well as with random scale-free networks allows us to better understand hierarchical versus 
modular features in several real-life complex networks. For random scale-free topologies the extent 
of topological hierarchy is shown to smoothly decline with 7 - the exponent of a degree distribution 
- reaching its highest possible value for 7 < 2 and quickly approaching zero for 7 > 3. 
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Networks recently came to the focus of attention of the 
complex systems research. Indeed, most complex systems 
have an underlying network serving as a "backbone" for 
its dynamical processes. The large-scale topological or- 
ganization of a particular complex network is related to 
both its functional role and historical background. Thus 
it is important to develop quantitative tools allowing one 
to detect and measure significant features in the topology 
of a given network. 

The hierarchical organization is a common feature of 
many complex systems. As an example of a hierarchy 
one might think of the organizational structure of a large 
company. The defining feature of a hierarchical organi- 
zation is the existence of a hierarchical path connecting 
any two of its nodes. It can be thought of as a trajectory 
of a request initiated at one of the nodes and reaching its 
destination node through the "chain of command" . Such 
a request first goes up the steps of the hierarchy until it 
reaches the first common boss of its sender and recipient 
after which it descend the hierarchical levels down to the 
destination node. In most real-life complex systems the 
simple tree-like hierarchical network may be augmented 
by "shortcuts" bypassing the chain of command. Such 
non-hierarchical shortcuts make the task of detecting the 
hierarchical structure much more non-trivial. The ques- 
tion we want to address in this work is how to detect 
and measure the extent of the hierarchy manifested in 
the topology of a given complex network. 

Real world networks often have very broad degree dis- 
tribution and thus the degree in itself gives a sensi- 
ble characteristic of a node which has to be preserved in 
any randomized version of a network 0- 13 In fact, in 
a number of systems nodes with higher degrees are on 
average more important than their lower degree counter- 
parts. For example, for the Internet the number of hard- 
wired connections a given Autonomous System serves as 
a proxy of its importance with the most connected nodes 
being global Internet Service Providers. For WWW the 



in-degree of a web page can serve as a measure of its 
popularity and hence importance; highly connected hub- 
airports of airline networks typically located in large 
cities, etc. In what follows we use the degree of a node as 
a proxy for its rank in the hierarchy based on the relative 
importance of nodes. Thus we propose a way to couple a 
local topological quantity, the degree, to a global struc- 
ture, hierarchy: Thereby we define and quantify topolog- 
ical hierarchy as a way to characterize networks beyond 
their degree distribution and its two point correlation 
function USE]- However, we would like to point out 
that our methods could be generalized to hierarchies de- 
fined in terms of any other characteristic of individual 
nodes, being it wealth, mass, or some other appropriate 
quantity. 

We quantify the hierarchical topologypf a network us- 
ing the concept of a hierarchical path @ : a path be- 
tween two nodes in a network is called hierarchical if it 
consists of an "up path" - where one is allowed to step 
from node i to node j only if their degrees fej , kj satisfy 
fej < kj followed by a "down path" - where only steps to 
nodes of lower or equal degree are allowed. Either up or 
down path is allowed to have zero length. This definition 
of a hierarchical path follows the above mentioned tra- 
jectory of a request which is first forwarded up and then 
descends down the levels of a hierarchy quantified by h. 
It is also similar to the definition proposed in and jo]. 
The length of the shortest hierarchical path between a 
given pair of nodes can be either: 1) equal to the length 
of the shortest path between these nodes; 2) longer than 
it; 3) not exist at all if these two nodes cannot be con- 
nected by any hierarchical path. The fraction of pairs 
in each of these three categories is denoted as T , S, and 
hi = 1 — T — S correspondingly. Equivalently, the hierar- 
chical fraction T can be viewed as a fraction of shortest 
paths in the network that are hierarchical, while S as 
the probability of finding a non-hierarchical shortcut - a 
path shorter than the shortest hierarchical path between 
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a pair of nodes. 

We analyzed in detail the hierarchical structure of sev- 
eral real- life complex networks: 1) The Internet con- 
sisting of autonomous systems (AS) hardwired to each 
other (6474 nodes, 1572 edges); 2) The largest con- 
nected component of the yeast protein interaction net- 
work @ (2839 nodes, 4220 edges); 3) The largest con- 
nected component of the E-mail mutual correspondence 
network (25151 nodes, 19963 edges) Q; 4) The network 
of CEOs (executive company directors); two directors are 
connected if they both belong to at least one common 
board of directors (6193 nodes, 43077 edges) 01 . To 
have a reference point, we compare these networks with 
their random connected counterparts in which the de- 
gree of every individual node is strictly preserved. In 
practice this is done by multiple edge-rewiring moves 
0, where two links between two randomly selected 
pairs of nodes are rewired with the constraint that one 
only accepts moves in which no double links are created, 
and where the network remains connected. We find that 
the randomized version of the Internet, characterized by 
T r = 0.99, is almost as hierarchical as the real Internet, 
where T = 0.95. The same is true for the E-mail net- 
work {T r = 0.98 vs T = 0.97) and the CEO network 
{T r = 0.84 vs T = 0.78). On the other hand, the ran- 
domized protein interaction network, T r — 0.88, is sig- 
nificantly more hierarchical than the real one, T = 0.33. 
This anti-hierarchical feature of the protein interaction 
network reflects a topology where highly connected nodes 
are placed on the periphery, and not in the center of the 
network Q. 

We also found that a small reduction in T for the In- 
ternet compared to its randomized counterpart is mainly 
due to an increase in the number of non-hierarchical 
shortcuts, S = 0.02 (<5> r =0). This feature is even more 
pronounced for the yeast protein network, with S = 0.17 
{S r = 0.02). One of the possible explanation for this 
phenomena can be the natural tendency toward shorter 
distances and thus toward faster and more specific sig- 
naling. 

As was shown above, a random network with a given 
degree distribution provides a useful benchmark for the 
extent of hierarchy in real- life complex networks. It is 
interesting also to consider the extreme cases: that is to 
construct networks that are the most or alternatively the 
least hierarchical for a given degree distribution. This 
is important for positioning real networks not only with 
the respect to its random counterpart, but also with re- 
spect to the extreme limits the network of a given degree 
distribution can achieve. 

Similar to a randomized version, the maximally hi- 
erarchal version of a network is generated by multiple 
rcwirings of pairs of edges. One has to add however a 
particular preference for reconnection: At each step one 
selects two pairs of connected nodes and attempts to con- 
nect the node with the highest among these four nodes 




FIG. 1: Maximally hierarchical (a), random (b) and maximal 
anti-hierarchical (c) networks of size N — 50 nodes and node 
degree distribution f{k) oc 1/fc 2 ' 5 . 

to the node of the next highest degree in this subset. The 
remaining two nodes are then linked together (multiple 
links are forbidden, and the network should always re- 
main connected). 

The maximally anti-hierarchal version of a network can 
be constructed by the same algorithm but with the oppo- 
site preference of reconnection: the node with the high- 
est degree is linked with that with the lowest degree. In 
Figs. Qi,c we show the maximally hierarchical respect- 
fully anti-hierarchical networks with the same node de- 
gree distribution as a random network, shown in Fig.^D. 
We have found that applying the above algorithm to all 
four empirical networks it is possible to achieve the limits 
where T=l for maximal hierarchy and T rs for maxi- 
mal anti-hierarchy. 

As one can see from Fig. ^ the maximally hierarchical 
or anti-hierarchical networks show strong correlations be- 
tween degrees of connected nodes 1 1 1 1 ] that can be quanti- 
fied through either the correlation profile [3j or the assor- 
tativity measure, r introduced in Ref. |l2| . We consider a 
modified assortativity measure, similar to the one used in 
Ref. [13, but here defined as tad = m ( ((t^)) )' wnere 
(kikj) is the average over all pairs i and j of nearest- 
neighbor nodes in the network and {{kikj)) r is the aver- 
age of {kikj) in an ensemble of randomized networks gen- 
erated as described above 0,0]. We find the maximally 
hierarchical topology to be always assortative {tad > 0), 
while the maximal anti-hierarchical topology - disassor- 
tative {rAD < 0). For example, for a network with the 
node degree distribution f{k) oc 1/fc 2 - 5 , tad = 0.14 and 
— 1.24 for the maximal hierarchy and the anti- hierarchy 
respectively. For comparison, the protein-protein interac- 
tions in yeast, which are well described by f{k) oc 1/fc 2 ' 5 , 
has tad = —0.82. We further stress that assortativity 
and hierarchical topology are in general not prerequisites 
for each other. 

Motivated by the abundance of real-life networks char- 
acterized by a broad, often scale-free, degree distribution 
(as it is the case for the empirical networks we are consid- 
ering here), we quantified the hierarchical fraction ^"asa 
function of the exponent 7 in random scale-free networks 
with a power law degree distribution f{k) oc 1/fc 7 . Such 
networks were constructed by first generating a set of 
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FIG. 2: a) The fraction of shortest paths that are also hi- 
erarchical, T and b) The fraction of non-hierarchical short- 
cuts, <5>, as a function of 7 for three system sizes, N = 
300, 1000, 3000. The shadowed area in a) corresponds to error 
estimates for a network of size N = 3000. 

power-law distributed degrees of individual nodes, then 
linking the edges to create a single-component network, 
and finally randomizing the resulting network using the 
algorithm of Ref. Q , which preserves individual degrees 
and connectedness of the network. 

Fig. |2K shows T vs 7 measured in random scale-free 
networks for different system sizes. The decrease from 
T= 1 to happens as 7 grows from 2 to 3 with a 
smooth transition around 7 ~ 2.6 that weakly depends 
on the system size. We remark that for 7 < 2, T = 1 and 
is nearly independent on the upper cutoff of the degree 
distribution, which is required in this case. 

Fig. shows S vs 7 in random scale- free networks. 
One can see that S — ► as 7 — > 2 and 3. Indeed, as 
7^2 the largest hubs become dominant and the typical 
distance in a network approaches 2 (almost any pair of 
nodes are connected via at least one hub). This makes 
most shortest paths via a hub to be hierarchical. In the 
limit 7^3, the topology of the network is very close 
to a tree, which in its turn implies that the number of 
alternative paths approaches zero, and thus again S — > 0. 

We have seen that as 7 approaches 2 almost all pairs 
of nodes tend to have at least one hierarchical paths con- 
necting them ( J 7 — ► 1 and hence U == 1 — T — S — > 0). 
The existence of hierarchical paths connecting most pairs 
of nodes means that at the very least the majority of 
nodes have at least one neighbor with a degree higher 
then their own. 

Let us first calculate the probability that a given edge 
is attached to a node with degree larger than k 
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Here for 7 < 2 one can only have a scale-free distributions 
below an upper cutoff K. Thus in the absence of degree- 
degree correlations the probability that a node of degree 
k has at least one neighbor of degree higher than itself is 
given by 
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FIG. 3: The probability P(k ne i g hbor > k) for a node of degree 
k to have a neighbor of degree kneighbor > k in an infinitely 
large scale free network, that is not necessarily connected, as 
function of a) - exponent 7 and b) - degree of the node k for 
7 = 2.5, 3 and 4. 



OC 




for 7 < 2 (2) 
for 7 > 2 W 



In Figure we plot this probability of having a boss 
(neighbor with a higher degree) as a function of 7 for 
three different values of the degree k, Fig. and as a 
function of k for different values of 7, Fig. [SJa. For 7 < 2 
both low and high degree nodes always have a higher con- 
nected neighbor and for 7 > 3 the high k nodes nearly 
never have a boss. For 2 < 7 < 3, low connected nodes of- 
ten have no higher connected neighbors (see Fig.^), but 
as Pikneighbor > k) — > 1 for increasing k there is a hierar- 
chical core of highly connected nodes. In popular terms, 
at these intermediate values of 7 many low degree nodes 
escape the hierarchy, while medium and highly connected 
nodes have bosses. Above 7 = 3, Pikneighbor > k) de- 
creases to zero with degree. Thus for these high values of 
7 a network becomes modular with each of the modules 
centered around a local hub. 

Figure^ shows the possible values of T for hierarchies 
and anti-hierarchies for 7 G (2,3). One can see that 
even the networks of narrow degree distribution can be 
organized hierarchically (see upper limit for T for 7 = 3) 
as well as networks of broad degree could be rearranged 
to suppress "self-hierarchical" features (see lower limit 
for T for 7 = 2). 

In Figure 0^ we summarize the results of our study 
of real and random scale-free networks by displaying the 
hierarchical fraction J- observed in real world networks 
(black dots) relative to its value for the random scale-free 
networks with the corresponding value of 7 (solid line). 
As discussed above the Internet, e-mail and CEO net- 
works are about as hierarchical as their random scale- free 
counterparts, while that of protein-protein interactions 
in yeast is significantly anti-hierarchical. Dark shaded 
regions in this figure correspond to the range between 
maximally hierarchical and respectively anti-hierarchical 
networks for a given value of 7. 

Another interesting aspect is the impact the hierar- 
chical structure has on the overall robustness of the net- 
work. In Figure 4b we show the average size of the largest 
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FIG. 4: a) Possible hierarchical organizations of networks, 
dark-shadowed area shows the limits for the average values, 
and light-shadowed area are the corresponding limits includ- 
ing possible variations from sample to sample in a random 
scale-free network with N — 10 3 nodes. The black dots show 
values of the hierarchical fraction T and the degree exponent 
7 for the Internet, E-mail, Yeast Protein Interaction and CEO 
networks discussed in the text. The solid line follows T vs 7 
in random scale-free networks, b) The results of the robust- 
ness analysis for maximally hierarchical (H) , random (R) and 
anti-hierarchical (AH) scale-free networks with 7 = 2.5. The 
effect of the intentional attack by deletion of the single most 
connecting node or edge in the network is reflected in the re- 
duced size Slc of the largest connected component after such 
an attack. The probability Pbreak for a network to break up 
following the removal of a randomly selected node or edge 
shows how sensitive it is to a random, non-intentional failure. 

connected component Slc °f the scale-free network with 
7 = 2.5 and N = 1000 after the intentional attack, con- 
sisting of choosing and removing a single edge (left col- 
umn) or node (right column) in such a way as to min- 
imize the Slc m the resulting network. The smaller is 
the average Slc the more vulnerable is the network with 
respect to attacks. To characterize the robustness of a 
network with respect to random failures we specify the 
likelihood Pbreak that a removal of the single node/edge 
disconnects the network. We find that anti-hierarchical 
topologies are most vulnerable with respect to attacks 
on their edges while hierarchical topologies are sensitive 
to node attacks. Apart from that, hierarchies are most 
vulnerable to random failures. 

In summary, we have discussed hierarchical organiza- 
tion manifested in topology of complex networks, and 
demonstrated how it can be used to characterize possi- 
ble network architectures beyond the degree distribution 
of their nodes. We quantified the hierarchal structure 
as the fraction of shortest paths that are also hierarchi- 
cal. It was found that this quantity approaches its max- 



imum value for marginally divergent scale-free networks 
7 < 2. It was also shown that anti-hierarchy is natu- 
rally related to modular features of networks. Finally 
we found that hierarchal as well as anti-hierarchical net- 
work topologies have implications for signaling and ro- 
bustness against various types of attacks and malfunc- 
tions, with anti-hierarchies being quite reliable against 
the most types of perturbations. 

Acknowledgments: We thank Aspen Center for 
Physics for hospitality. Work at Brookhaven National 
Laboratory was carried out under Contract No. DE- 
AC02-98CH10886, Division of Material Science, U.S. De- 
partment of Energy. 



* Electronic address: trusina@tp.umu.se 
' Electronic address: maslov@bnl.gov 

[1] For a recent collection of articles describing different 
examples of scale-free networks see e.g. "Handbook of 
Graphs and Networks", edited by S. Bornholdt and H. 
G. Schuster, John Wiley and VCH publishers (2002) or 
J. F. F. Mendes, S. N. Dorogovtsev, A. F. Ioffe, "Evolu- 
tion of Networks: From Biological Nets to the Internet 
and WWW", Oxford Press (2003). 

[2] M.E.J. Newman, S.H Strogatz and D.J. Watts, Phys. 
Rev. E 64 026118. M.E.J. Newman, cond-mat/0202208 

(2001) . 

[3] S. Maslov and & K. Sneppen, Science 296, 910-913 

(2002) . 

[4] S. Maslov, K. Sneppen, and A. Zaliznyak, 
cond-mat/0205379 (2002); Physica A 333. 529 (2004). 

[5] H. Tangmunarunkit, R. Govinadan, S. Jamin, S. Shenker, 
and W. Willinger, Tech. Rep. 01-746, Computer Science 
Department, University of Southern California (2001). 

[6] L. Gao, Proc. IEEE INFOCOM, November (2000). 

[7] The dataset collected by the Oregon Views project 
was downloaded from the website of the Na- 
tional Laboratory for Applied Network Research 
(http://moat.nlanr.net/AS/ ). 

[8] Ito, T. et aJ. Proc. Natl. Acad. Sci. USA 98, 4569-4574 
(2001). 

[9] H. Ebel, L. I. Mielsch, and S. Bornholdt, Phys. Rev. E 
66, 035103 (2002). 

[10] G. F. Davis, M. Yoo, and W. E. Baker, preprint, Univer- 
sity of Michigan Business School (2001). 

[11] P. L. Krapivsky, S. Redner, Phys. Rev. E 63, 066123 
(2001). 

[12] M.E.J. Newman, Phys. Rev. Lett. 89, 208701 (2002). 
J. Park and M.E.J. Newman, Phys. Rev. E 68, 026112 

(2003) . 



